Walking through why big data exists and how the technologies work together. We'll be building a data pipeline from scratch start to finish using AWS EMR, Spark, Airflow, Presto, Hive, PySpark, Docker and more!
Walking through why big data exists and how the technologies work together. We'll be building a data pipeline from scratch start to finish using AWS EMR, Spark, Airflow, Presto, Hive, PySpark, Docker and more!